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evaluations of performance. Two individuals who 
performed equally well but had dissimilar expectancies 
evaluated their performance after task completion very 
differently: The individual with the higher pre-task 
expectancies was more likely to overestimate his/her 
performance after task completion than the individual 
with low initial expectancies. Thus, the gender difference 
in the accuracy of self-evaluations on masculine tasks 
can be attributed, in part, to their low expectancies which 
depress their post-task self-evaluations (Beyer, 1990, 
1998, 2002; Beyer & Bowden, 1997). 



Abstract 

Gender differences in confidence in three academic areas 
of varying gender-type were assessed. We used questions 
about Chemistry (masculine), English (feminine), and 
Art (neutral). Females underestimated their performance 
more than did males on Chemistry and English questions. 



Self-evaluation of performance is a fundamental 
task of self-regulation. However, people use some 
latitude when evaluating their abilities, making self- 
evaluations of performance at least somewhat inaccurate 
for many people (for a review see Beyer, 1998). 

Recently there has been an attempt to discover the 
moderating and mediating variables which affect the 
accuracy of self-evaluations. Beyer’s (1990, 1998, 
1999a, 2002; Beyer & Bowden, 1997) work has focused 
on participants’ gender, the gender-type of the tasks on 
which they are working, and expectancies. 

She compared participants’ posMask self- 
evaluations of performance to their actual performance 
and found that females underestimated their performance 
on tasks participants considered “masculine” (tests of 
knowledge of sports figures, politics, or mathematics). 
Males either overestimated or accurately evaluated their 
performance. No gender differences in the accuracy of 
self-evaluations were found for tests of knowledge of 
movie and TV stars or English (considered feminine 
gender-typed by participants) or neutral tests of common 
knowledge, character detection, practical questions, 
anagrams, or history and geography. 

This gender difference in self-evaluations was 
mediated, in part, by self-consistency: Pre-task 

expectancies of performance affected post-task self- 



The Goals of this Research 

The present research examines gender differences 
in the accuracy of self-evaluations for three subject areas 
of varying gender-type: Chemistry (masculine), English 
(feminine), and Art (neutral). It was hypothesized, in 
accordance with Beyer’s (1990, 1998, 2002; Beyer & 
Bowden, 1997) previous research, that women would 
evaluate their performance less accurately than men in 
Chemistry, but not English and Art. 

Rather than presenting test questions on paper, as 
was done in the past, we used a modified methodology 
for this research. Test questions were presented auditorily 
via a computer. The advantage of this presentation 
mode is that participants’ reaction times to questions 
can be recorded. 

A study of gender differences in performance on 
the game show Jeopardy found that women were less 
likely than men to be able to select question types because 
of their slower response times (Brownlow, Whitener, & 
Rupert, 1998). Although slower reaction times may 
reflect a male advantage in either knowledge or 
psychomotor speed, an alternative interpretation is that 
women’s lower confidence in their ability to answer 
questions correctly manifested itself in slower reaction 
times. We were interested in whether participants’ 
reaction times to test questions would reveal gender 
differences in self-perceptions of ability for different 
subject areas. 

In addition, this research adds to our understanding 
of the self-evaluation process by investigating the effect 
of dysphoria. So far, the relation between gender and 
dysphoria has remained largely unexplored. The present 
research aims to fill this void in the self-perception 
literature. 



* Poster presented at the annual meeting of the American 
Psychological Society in New Orleans, LA, June 2002. 

^ To whom correspondence should be addressed at 
Department of Psychology, University of Wisconsin- 
Parkside, Kenosha, WI 53141. e-mail: beyer@uwp.edu 



Depressive Realism 

According to work on depressive realism, mildly 
depressed individuals are more accurate self-perceivers 
than their overly optimistic nondysphoric counterparts. 
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However, research on depressive realism has ignored 
gender differences in the accuracy of self-evaluations. 
An innovation of the present research is that it explores 
the interrelation between gender and dysphoria in terms 
of the accuracy of self-evaluations. 

Dysphorics are more accurate than riondysphorics 
in their self-evaluations of social competence 
(Lewinsohn, Mischel, Chaplin, & Barton, 1980; 
McNamara & Hackett, 1986), recall of their toddlers’ 
negative behaviors (Lovejoy, 1991), predictions about 
the future (Alloy & Ahrens, 1987; Crocker, Alloy, & 
Kayne, 1988), assessments of the degree of control over 
external stimuli (Abramson & Alloy, 1981; Alloy & 
Abramson, 1979, 1982; Alloy, Abramson, & Kossman, 
1985; Alloy, Abramson, & Viscusi, 1981; Dobson & 
Tranche, 1989; Glass, McKnight, & Valdimarsdottir, 
1993; Martin, Abramson, & Alloy, 1984), and they are 
more sensitive to changes in reward contingencies 
(Rosenfarb, Burker, Morris, & Cush, 1993). 

However, research on depressive realism has been 
criticized for failing to use an objective standard for 
accuracy (Ackermann & DeRubeis, 1991). Furthermore, 
the depressive realism hypothesis has been challenged 
by a substantial number of studies which found no 
evidence for its existence (Benassi & Mahler, 1985; 
Buchwald, 1977; Campbell & Fehr, 1990; DeMonbreun 
& Craighead, 1977; Dobson & Shaw, 1981; Gotlib, 1981, 
1983; Vazquez, 1987; Wener & Rehm, 1975). For 
example, dysphorics’ thoughts about themselves are 
quite negative and their predictions about the future 
more pessimistic than those of nondysphorics (e.g., 
Pietromonaco & Markus, 1985). But this could be a 
function of dysphorics’ more negative past experiences 
rather than representing self-derogation or inaccurate 
self-perceptions (Koenig, Ragin, & Harrow, 1995; 
Shrauger, Mariano, & Walter, 1998). Thus, the question 
is: Are dysphorics’ negative self-perceptions accurate? 
Or are they, like females’ self-evaluations on masculine 
tasks, underestimations of their abilities, popularity, etc.? 

This research fills a gap insofar as it links the separate 
literatures on gender and self-evaluations on the one 
hand and dysphoria and self-perceptions on the other 
hand. Such research may provide both theoretical 
insights and be of practical value for intervention 
programs. The fact that the literature on depressive 
realism has paid so little attention to gender differences 
in the accuracy of self-perceptions is surprising given 
that women are twice as likely to suffer from depression 
as men (Nolen-Hoeksema & Girgus, 1994). Therefore, 
according to the depressive realism hypothesis, women 
should be more accurate self-evaluators than men. 



However, this prediction has rarely been explicitly tested. 

Hypotheses 

To summarize, this study addresses the following 
hypotheses. (1) Gender differences in the accuracy of 
self-evaluations of performance will be found for a 
“masculine” subject area (Chemistry) but not for 
feminine or neutral subjects (English and Art). (2) 
Dysphorics will not be more accurate self-evaluators 
than nondysphorics. (3) Gender differences in reaction 
times to different subject areas will be found even when 
knowledge of the subject area is controlled. This would 
reveal gender differences in self-perceptions of ability 
in different subject areas. 

Method 

Participants 

Participants were 361 General Psychology students 
(229 females and 132 males). Participants received 
course credit for their participation. 

Materials 

Participants were presented with 90 True/False (T/F) 
questions from three subject areas. Thirty questions each 
pertained to Chemistry (masculine domain), English 
grammar (feminine domain), and Art (neutral domain). 
The three subjects were interspersed across the 90 
questions. A pretest of a greater pool of questions was 
used to select the final 30 questions for each subject. 
This yielded three tests of roughly equal difficulty. 

Procedure 

Participants filled out the Beck Depression 
Inventory (BDI; Beck, Ward, Mendelson, Mock, & 
Erbaugh, 1961) and were presented with a general 
description of the task including information on the 
number of T/F questions, and the amount of time 
available. Participants were informed that their reaction 
times would be recorded. Participants were run through 
two sample questions for each subject to familiarize 
them with the procedure. 

Participants were run in groups of two on Macintosh 
SE computers. The T/F questions were auditorily 
presented via computer to avoid individual differences 
in reading speed. Prior to each question the prompt 
“READY?” appeared. Participants had to press a key 
to activate the presentation of the next question. This 
ensured that participants were ready to listen to the next 
question. Half of the participants listened to a female, 
the other half to a male voice. Participants pressed one 
of two keys to indicate whether they believed the answer 
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to the question was True or False (for half of the 
participants the T/F keys were reversed). Approximately 
half of the participants were informed prior to each 
question which of the three subjects would be presented 
next, the other half of the participants were unaware of 
the subject area of the next question^. Response times 
were recorded by the computer. 

The questions were broken into three blocks. Prior 
to each block of 30 T/F questions, participants were 
asked how many questions from each domain they 
expected to answer correctly (expectancies). After each 
block, participants were asked to estimate how many 
questions from each domain they had answered correctly 
(self-evaluations of performance). After answering all 
90 questions, participants estimated how many questions 
of each type they answered correctly. 

Results 

All significance tests are two-tailed. 

Gender Differences in Expectancies, Performance, and 
Self-evaluations 

Males answered significantly more Chemistry 
questions correctly, F(l, 358) = 5.14,/? < .03, expected 
to answer more questions, F(l, 318) = 7.68, p < .006, 
and thought they had answered more questions than did 
females, F(l, 353) = 18.05, p < .0001. For English 
there was neither a gender difference in performance, 
F(l, 352) = 2.24, p < .14, nor a gender difference in 
expectancies, F(l, 311) < 1, but males thought they had 
answered more questions than did females to a marginal 
degree, F(l, 339) = 3.46, p < .07. For Art there was 
neither a gender difference in performance, F(l, 355) = 
2.61, p < . 11 , nor a gender difference in expectancies, 
F(l, 313) < 1, but males thought they had answered 
more questions than did females, F(l, 349) = 4.16, p < 
.05. 

Dysphoria Status Differences in Accuracy of Self- 
evaluations 

Hypothesis 1 predicts that gender differences in 
the accuracy of self-evaluations will be significant only 
for Chemistry. Hypothesis 2 predicts that dysphorics 
are not more accurate self-evaluators than nondysphorics. 

Accuracy of self-evaluations was calculated by 
subtracting performance from post-test self-evaluation 
scores. Positive accuracy scores indicate overestimations 
of performance, negative scores underestimations, while 

^ Because this variable had no consistent effect on the 
dependent variables and did not interact with gender, it will 
not be discussed below. 



scores approaching 0 indicate accurate evaluations. 

For each of the three subject areas, the accuracy of 
self-evaluations was regressed on participant gender, 
score on the BDI (as a continuous variable), and their 
interaction. Table 1 lists the estimated means. 



TABLE 1: Accuracy of Self-evaluations by Gender 
and Dysphoria Status 





Females 


Males 


English Questions Nondysphorics 


-6.2 


-4.9 


Dysphorics 


-7.3 


-6.0 


Art Questions Nondysphorics 


-7.4 


-6.8 


Dysphorics 


-8.1 


-7.4 


Chemistry Questions Nondysphorics 


-6.7 


-4.9 


Dysphorics 


-7.4 


-5.6 



Gender was a significant predictor of the accuracy 
of self-evaluations for Chemistry, F(l, 349) = 9.33, p < 
.002, and English, F(l, 330) = 6.67, p < .01, where 
males underestimated their performance less than did 
females. There was no gender difference in the accuracy 
of Art self-evaluations, F(l, 342) = 1.25, p < .27. 

For Art and Chemistry, BDI score did not predict 
the accuracy of self-evaluations, F(l, 342)< 1; F(l, 349) 
= 1.35,p < .25. Only for English did BDI score predict 
the accuracy of self-evaluations, F(l, 330) = 5.67, p < 
.02. Dysphorics underestimated their English 
performance more than nondysphorics did. Thus, 
contrary to the depressive realism hypothesis, but as 
predicted, dysphorics were not more accurate self- 
evaluators than nondysphorics. 

Gender Differences in Reaction Times 

For each of the three subject areas, reaction times 
were regressed on participant gender, performance score, 
score on the BDI (as a continuous variable), and the 
interaction terms. Performance score was included to 
control for differences in knowledge. 

There was no significant gender difference in 
reaction times for English, F(l, 340) < 1. There were 
borderline significant gender differences in reaction 
times for Art, F(l, 337) = 3.68, p < .06, and Chemistry, 
F(l, 343) = 3.74, p < .06, with females having faster 
reaction times than males. 

Dysphorics had slower reaction times than 
nondysphorics for English, F(l, 340) =7.60, p < .006, 
and Art, F(l, 337) =3.21, p < .08. For Chemistry the 
triple interaction between gender, BDI score and 
performance was significant, F(l, 343) = 7.09, p < 
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.008, with low-performing nondysphoric females having 
the fastest reaction times and high-performing dysphoric 
males the slowest reaction times. 

Discussion 

Gender Differences in the Accuracy of Self-evaluations 
There was a significant gender difference in the 
accuracy of self-evaluations for Chemistry and English 
with males underestimating their performance 
significantly less than females did. The results for 
Chemistry are in line with findings that girls, but not 
boys, underestimate their grades and ability in 
mathematics (Bomholt, 2000; Frome & Eccles, 1998; 
Hannover, 1991; Tiedemann, 2000). Furthermore, the 
gender stereotype regarding males’ superiority in 
chemistry is still pervasive (Beyer, 1999b). This suggests 
that chemistry is still an arena fraught with self-doubt 
for females. This replicates the gender difference in the 
accuracy of self-evaluations for mathematics (Beyer, 
2002) and sports (Beyer, 1990, 1998, 1999a; Beyer & 
Bowden, 1997). The fact that women underestimated 
their performance for English more than men did was 
not anticipated. 

The consequences of inaccurately negative self- 
perceptions have been discussed in detail by Beyer 
(1998). Briefly, positive self-perceptions are related to 
self-esteem and psychological health (e.g., Kurman & 
Sriram, 1997; Taylor & Armor, 1997), whereas negative 
self-evaluations are related to depression (e.g.. Glass et 
al., 1993). Positive self-perceptions are also intimately 
tied to aspirations, preference for challenging tasks, 
curiosity, intrinsic motivation, and persistence and thus 
have desirable effects on performance (see Beyer, 1995 
for an overview; Greene, DeBacker, Ravindran, & 
Krows, 1999). This suggests that the gender difference 
in the accuracy of self-evaluations for Chemistry may 
have deleterious affective and behavioral consequences. 

Depressive Realism 

The depressive realism hypothesis received no 
support from the present research. A significant effect 
of dysphoria was found only for English, where 
dysphorics were less accurate than nondysphorics. A 
similar result was obtained by Beyer (2002). These 
results are in line with research which found that 
dysphorics show biases involving slowed processing of 
positive information and faster processing of negative 
information, whereas nondysphorics possess the reverse 
orientation to the self (Moretti, Segal, McCann, Shaw, 
Miller, & Vella, 1996). In fact, dysphorics tend to 
activate negative self-reference information relatively 



automatically and unintentionally (Bargh & Tota, 1988). 

Nondysphorics have been called “cognitive misers” 
(Fiske & Taylor, 1991) who “process information in 
trivial situations casually, with self-serving optimistic 
biases such as the illusion of control” (Pacini, Muir, & 
Epstein, 1998, p. 1057). Maybe so, but dysphorics are 
cognitive misers as well, processing information with a 
negative bias. 

Thus, the present study adds to a growing body of 
research suggesting problems with the depressive realism 
hypothesis. The question of whether dysphorics or 
nondysphorics are more accurate self-evaluators is too 
simplistic. The reason for the now-you-see-it-now- 
you-don’t pattern of results for depressive realism may 
be that too little attention has been paid to discovering 
moderating variables which affect the accuracy of self- 
perceptions of dysphorics and nondysphorics. For 
example, the depressive realism effect is most likely 
obtained in artificial laboratory situations, whereas 
nondysphorics may be more accurate self-evaluators in 
nontrivial, realistic situations (Dobson & Franche, 1989; 
Dunning & Story, 1991; Pacini et al., 1998). A similar 
challenge to the generality of the depressive realism 
effect was obtained by Shrauger et al. (1998). Dysphorics 
were more accurate than nondysphorics when making 
predictions about undesirable events in their lives, 
whereas the reverse was true for desirable events. In 
all likelihood specifics about the task at hand, such as 
task difficulty, or the relative ease with which one can 
judge one’s performance, are of importance. 

Gender Differences in Reaction Times 

For Art and Chemistry women had faster reaction 
times than men. Thus, women’s low perception of 
performance in Chemistry obtained by using self- 
evaluations of performance was not mirrored by more 
conservative reaction times. This suggests that reaction 
times are not a good measure of confidence. Interestingly, 
dysphoria clearly manifests itself in slowed reaction 
times, even when equating for performance. 

Conclusions 

The present study has replicated the findings of 
Beyer (1990, 1998, 1999a, 2002; Beyer & Bowden, 
1997) in the area of chemistry. The results indicate that 
females harbor self-doubts regarding their chemistry 
ability. 

No evidence for depressive realism was obtained. 
Dysphorics were not more accurate self-evaluators than 
nondysphorics. Because females are twice as likely to 
suffer from depression than males (Nolen-Hoeksema & 




6 



V 

I 



Girgus, 1994), more research on the interrelations among 
gender, depression, and the accuracy of self-perceptions 
is warranted. 
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